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SEQUENCE LISTING 



<110> Strathmann AG & Co. 

<120> Virus-Vaccine 

<130> P057760 

<140> 
<141> 

<150> 199 07 485.2 

<151> 1999-02-12 

<160> 12 

<170> Patentln Ver. 2.1 

Q <210> 1 

<211> 9709 

<212> DNA 

<213> Human immunodeficiency virus 



s *f <400> 1 

=A tggaagggct aatttggtcc caaaaaagac aagagatcct tgatctgtgg atctaccaca 60 

M cacaaggcta cttccctgat tggcagaact acacaccagg gccagggatc agatatccac 12 0 

i9 tgacctttgg atggtgcttc aagttagtac cagttgaacc agagcaagta gaagaggcca 180 

i3 aataaggaga gaagaacagc ttgttacacc ctatgagcca gcatgggatg gaggacccgg 240 

□ agggagaagt attagtgtgg aagtttgaca gcctcctagc atttcgtcac atggcccgag 3 00 

agctgcatcc ggagtactac aaagactgct gacatcgagc tttctacaag ggactttccg 360 

ctggggactt tccagggagg tgtggcctgg gcgggactgg ggagtggcga gccctcagat 42 0 

gctacatata agcagctgct ttttgcctgt actgggtctc tctggttaga ccagatctga 480 

gcctgggagc tctctggcta actagggaac ccactgctta agcctcaata aagcttgcct 540 

H tgagtgctca aagtagtgtg tgcccgtctg ttgtgtgact ctggtaacta gagatccctc 600 

^ agaccctttt agtcagtgtg gaaaatctct agcagtggcg cccgaacagg gacttgaaag 660 

cgaaagtaaa gccagaggag atctctcgac gcaggactcg gcttgctgaa gcgcgcacgg 720 

caagaggcga ggggcggcga ctggtgagta cgccaaaaat tttgactagc ggaggctaga 780 

aggagagaga tgggtgcgag agcgtcggta ttaagcgggg gagaattaga taaatgggaa 840 

aaaattcggt taaggccagg gggaaagaaa caatataaac taaaacatat agtatgggca 900 

agcagggagc tagaacgatt cgcagttaat cctggccttt tagagacatc agaaggctgt 960 

agacaaatac tgggacagct acaaccatcc cttcagacag gatcagaaga acttagatca 1020 

ttatataata caatagcagt cctctattgt. gtgcatcaaa ggatagatgt aaaagacacc 1080 

aaggaagcct tagataagat agaggaagag caaaacaaaa gtaagaaaaa ggcacagcaa 1140 

gcagcagctg acacaggaaa caacagccag gtcagccaaa attaccctat agtgcagaac 1200 

ctccaggggc aaatggtaca tcaggccata tcacctagaa ctttaaatgc atgggtaaaa 1260 

gtagtagaag agaaggcttt cagcccagaa gtaataccca tgttttcagc attatcagaa 132 0 

ggagccaccc cacaagattt aaataccatg ctaaacacag tggggggaca tcaagcagcc 1380 

atgcaaatgt taaaagagac catcaatgag gaagctgcag aatgggatag attgcatcca 1440 

gtgcatgcag ggcctattgc accaggccag atgagagaac caaggggaag tgacatagca 1500 

ggaactacta gtacccttca ggaacaaata ggatggatga cacataatcc acctatccca 1560 

gtaggagaaa tctataaaag atggataatc ctgggattaa ataaaatagt aagaatgtat 162 0 

agccctacca gcattctgga cataagacaa ggaccaaagg aaccctttag agactatgta 1680 

gaccgattct ataaaactct aagagccgag caagcttcac aagaggtaaa aaattggatg 1740 

acagaaacct tgttggtcca aaatgcgaac ccagattgta agactatttt aaaagcattg 1800 

ggaccaggag cgacactaga agaaatgatg acagcatgtc agggagtggg gggacccggc 1860 

cataaagcaa gagttttggc tgaagcaatg agccaagtaa caaatccagc taccataatg 1920 



atacagaaag 
gaagggcaca 
aaggaaggac 
tggccttccc 
ccaccagaag 
ccgatagaca 
tcgtcacaat 
atacagtatt 
ttggaggttt 
aagctatagg 
tgactcagat 
aattaaagcc 
taaaagcatt 
ggcctgaaaa 
ggagaaaatt 
aattaggaat 
tgggcgatgc 
ccatacctag 
agggatggaa 
^ ttagaaaaca 
'2 ctgacttaga 
% ggtggggatt 
^ gttatgaact 
^ gctggactgt 
s y atgcagggat 
aagtagtacc 
\jl aagaaccggt 
agcaggggca 
(S caggaaaata 
^ cagtacaaaa 
^ tacccataca 
; = *7 ttcctgagtg 
aagaacccat 
aattaggaaa 
acacaacaaa 
tagaagtaaa 
ataagagtga 
tctacctggc 
tggtcagtgc 
aacatgagaa 
tagtagcaaa 
atggacaagt 
aagttatctt 
cagagacagg 
aaacagtaca 
ggtgggcggg 
tagaatctat 
atcttaagac 
ttggggggta 
aagaattaca 
gagatccagt 
tacaagataa 
atggaaaaca 
tggaaaagat 
agacatcact 
gatgctaaat 
ttgggtcagg 



gcaattttag 
tagccaaaaa 
accaaatgaa 
acaagggaag 
agagcttcag 
aggaactgta 
aaagataggg 
agaagaaatg 
tatcaaagta 
tacagtatta 
tggctgcact 
aggaatggat 
agtagaaatt 
tccatacaat 
agtagatttc 
accacatcct 
atatttttca 
tataaacaat 
aggatcacca 
aaatccagac 
aatagggcag 
taccacacca 
ccatcctgat 
caatgacata 
taaagtaagg 
actaacagaa 
acatggagtg 
aggccaatgg 
tgcaagaatg 
aatagccaca 
aaaggaaaca 
ggagtttgtc 
aataggagca 
agcaggatat 
tcagaagact 
catagtgaca 
atcagagtta 
atgggtacca 
tggaatcagg 
atatcacagt 
agaaatagta 
agactgtagc 
ggtagcagtt 
gcaagaaaca 
tacagacaat 
gatcaagcag 
gaataaagaa 
agcagtacaa 
cagtgcaggg 
aaaacaaatt 
ttggaaagga 
tagtgacata 
gatggcaggt 
tagtaaaaca 
atgaaagtac 
tagtaataac 
gagtctccat 



gaaccaaaga 
ttgcagggcc 
agattgtact 
gccagggaat 
gtttggggaa 
tcctttagct 
gggcaattaa 
aatttgccag 
ggacagtatg 
gtaggaccta 
ttaaattttc 
ggcccaaaag 
tgtacagaaa 
actccagtat 
agagaactta 
gcagggttaa 
gttcccttag 
gagacaccag 
gcaatattcc 
atagtcatct 
catagaacaa 
gacaaaaaac 
aaatggacag 
cagaaattag 
caattatgta 
gaagcagagc 
tattatgacc 
acatatcaaa 
aagggtgccc 
gaaagcatag 
tgggaagcat 
aatacccctc 
gaaactttct 
gtaactgaca 
gagttacaag 
gactcacaat 
gtcagtcaaa 
gcacacaaag 
aaagtactat 
aattggagag 
gccagctgtg 
ccaggaatat 
catgtagcca 
gcatacttcc 
ggcagcaatt 
gaatttggca 
ttaaagaaaa 
atggcagtat 
gaaagaatag 
acaaaaattc 
ccagcaaagc 
aaagtagtgc 
gatgattgtg 
ccatatgtat 
taatccaaaa 
aacatattgg 
agaatggagg 



aagactgtta 
cctaggaaaa 
gagagacagg 
tttcttcaga 
gagacaacaa 
tccctcagat 
aggaagctct 
gaagatggaa 
atcagatact 
cacctgtcaa 
ccattagtcc 
ttaaacaatg 
tggaaaagga 
ttgccataaa 
ataagagaac 
aacagaaaaa 
ataaagactt 
ggattagata 
agtgtagcat 
atcaatacat 
aaatagagga 
atcagaaaga 
tacagcctat 
tgggaaaatt 
aacttcttag 
tagaactggc 
catcaaaaga 
tttatcaaga 
acactaatga 
taatatgggg 
ggtggacaga 
ccttagtgaa 
atgtagatgg 
gaggaagaca 
caattcatct 
atgcattggg 
taatagagca 
gaattggagg 
ttttagatgg 
caatggctag 
ataaatgtca 
ggcagctaga 
gtggatatat 
tcttaaaatt 
tcaccagtac 
ttccctacaa 
ttataggaca 
tcatccacaa 
tagacataat 
aaaattttcg 
tcctctggaa 
caagaagaaa 
tggcaagtag 
atttcaagga 
ataagttcag 
ggtctgcata 
aaaaagagat 



agtgtttcaa 
agggctgttg 
ctaatttttt 
gcagaccaga 
ctccctctca 
cactctttgg 
attagataca 
accaaaaatg 
catagaaatc 
cataattgga 
tattgagact 
gccattgaca 
aggaaaaatt 
gaaaaaagac 
tcaagatttc 
atcagtaaca 
caggaagtat 
tcagtacaat 
gacaaaaatc 
ggatgatttg 
actgagacaa 
acctccattc 
agtgctgcca 
gaattgggca 
gggaaccaaa 
agaaaacagg 
cttaatagca 
gccatttaaa 
tgtgaaacaa 
aaagactcct 
gtattggcaa 
gttatggtac 
ggcagccaat 
aaaagttgtc 
agctttgcag 
aatcattcaa 
gttaataaaa 
aaatgaacaa 
aatagataag 
tgattttaac 
gctaaaaggg 
ttgtacacat 
agaagcagaa 
agcaggaaga 
tacagttaag 
tccccaaagt 
ggtaagagat 
ttttaaaaga 
agcaacagac 
ggtttattac 
aggtgaaggg 
agcaaagatc 
acaggatgag 
aagctaagga 
aagtacacat 
caggagaaag 
atagcacaca 



ttgtggcaaa 
gaaatgtgga 
agggaagatc 
gccaacagcc 
gaagcaggag 
cagcgacccc 
ggagcagatg 
atagggggaa 
tgcggacata 
agaaatctgt 
gtaccagtaa 
gaagaaaaaa 
tcaaaaattg 
agtactaaat 
tgggaagttc 
gtactggatg 
actgcattta 
gtgcttccac 
ttagagcctt 
tatgtaggat 
catctgttga 
ctttggatgg 
gaaaaggaca 
agtcagattt 
gcactaacag 
gagattctaa 
gaaatacaga 
aatctgaaaa 
ttaacagagg 
aaatttaaat 
gccacctgga 
cagttagaga 
agggaaacta 
cccctaacgg 
gattcgggat 
gcacaaccag 
aaggaaaaag 
gtagatgggt 
gcccaagaag 
ctaccacctg 
gaagccatgc 
ttagaaggaa 
gtaattccag 
tggccagtaa 
gccgcctgtt 
caaggagtaa 
caggctgaac 
aaagggggga 
atacaaacta 
agggacagca 
gcagtagtaa 
atcagggatt 
gattaacaca 
ctggttttat 
cccactaggg 
agactggcat 
agtagaccct 



1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 



gacctagcag accaactaat tcatctgcac tattttgatt gtttttcaga atctgctata 5400 

agaaatacca tattaggacg tatagttagt cctaggtgtg aatatcaagc aggacataac 5460 

aaggtaggat ctctacagta cttggcacta gcagcattaa taaaaccaaa acagataaag 5520 

ccacctttgc ctagtgttag gaaactgaca gaggacagat ggaacaagcc ccagaagacc 5580 

aagggccaca gagggagcca tacaatgaat ggacactaga gcttttagag gaacttaaga 5640 

gtgaagctgt tagacatttt cctaggatat ggctccataa cttaggacaa catatctatg 5700 

aaacttacgg ggatacttgg gcaggagtgg aagccataat aagaattctg caacaactgc 5760 

tgtttatcca tttcagaatt gggtgtcgac atagcagaat aggcgttact cgacagagga 5820 

gagcaagaaa tggagccagt agatcctaga ctagagccct ggaagcatcc aggaagtcag 5880 

cctaaaactg cttgtaccaa ttgctattgt aaaaagtgtt gctttcattg ccaagtttgt 5940 

ttcatgacaa aagccttagg catctcctat ggcaggaaga agcggagaca gcgacgaaga 6000 

gctcatcaga acagtcagac tcatcaagct tctctatcaa agcagtaagt agtacatgta 6060 

atgcaaccta taatagtagc aatagtagca ttagtagtag caataataat agcaatagtt 6120 

gtgtggtcca tagtaatcat agaatatagg aaaatattaa gacaaagaaa aatagacagg 6180 

ttaattgata gactaataga aagagcagaa gacagtggca atgagagtga aggagaagta 6240 

tcagcacttg tggagatggg ggtggaaatg gggcaccatg ctccttggga tattgatgat 6300 

ctgtagtgct acagaaaaat tgtgggtcac agtctattat ggggtacctg tgtggaagga 6360 

agcaaccacc actctatttt gtgcatcaga tgctaaagca tatgatacag aggtacataa 642 0 

tgtttgggcc acacatgcct gtgtacccac agaccccaac ccacaagaag tagtattggt 6480 

aaatgtgaca gaaaatttta acatgtggaa aaatgacatg gtagaacaga tgcatgagga 6540 

tataatcagt ttatgggatc aaagcctaaa gccatgtgta aaattaaccc cactctgtgt 6600 

tagtttaaag tgcactgatt tgaagaatga tactaatacc aatagtagta gcgggagaat 6660 

gataatggag aaaggagaga taaaaaactg ctctttcaat atcagcacaa gcataagaga 6720 

taaggtgcag aaagaatatg cattctttta taaacttgat atagtaccaa tagataatac 6780 

cagctatagg ttgataagtt gtaacacctc agtcattaca caggcctgtc caaaggtatc 6840 

ctttgagcca attcccatac attattgtgc cccggctggt tttgcgattc taaaatgtaa 6900 

taataagacg ttcaatggaa caggaccatg tacaaatgtc agcacagtac aatgtacaca 6960 

tggaatcagg ccagtagtat caactcaact gctgttaaat ggcagtctag cagaagaaga 7020 

tgtagtaatt agatctgcca atttcacaga caatgctaaa accataatag tacagctgaa 7 080 

cacatctgta gaaattaatt gtacaagacc caacaacaat acaagaaaaa gtatccgtat 7140 

ccagagggga ccagggagag catttgttac aataggaaaa ataggaaata tgagacaagc 7200 

acattgtaac attagtagag caaaatggaa tgccacttta aaacagatag ctagcaaatt 7260 

aagagaacaa tttggaaata ataaaacaat aatctttaag caatcctcag gaggggaccc 7320 

agaaattgta acgcacagtt ttaattgtgg aggggaattt ttctactgta attcaacaca 7380 

actgtttaat agtacttggt ttaatagtac ttggagtact gaagggtcaa ataacactga 7440 

aggaagtgac acaatcacac tcccatgcag aataaaacaa tttataaaca tgtggcagga .7500 

agtaggaaaa gcaatgtatg cccctcccat cagtggacaa attagatgtt catcaaatat 7560 

tactgggctg ctattaacaa gagatggtgg taataacaac aatgggtccg agatcttcag 7620 

acctggagga ggcgatatga gggacaattg gagaagtgaa ttatataaat ataaagtagt 7680 

aaaaattgaa ccattaggag tagcacccac caaggcaaag agaagagtgg tgcagagaga 7740 

aaaaagagca gtgggaatag gagctttgtt ccttgggttc ttgggagcag caggaagcac 7800 

tatgggctgc acgtcaatga cgctgacggt acaggccaga caattattgt ctgatatagt 7860 

gcagcagcag aacaatttgc tgagggctat tgaggcgcaa cagcatctgt tgcaactcac 7920 

agtctggggc atcaaacagc tccaggcaag aatcctggct gtggaaagat acctaaagga 7980 

tcaacagctc ctggggattt ggggttgctc tggaaaactc atttgcacca ctgctgtgcc 8040 

ttggaatgct agttggagta ataaatctct ggaacagatt tggaataaca tgacctggat 8100 

ggagtgggac agagaaatta acaattacac aagcttaata cactccttaa ttgaagaatc 8160 

gcaaaaccag caagaaaaga atgaacaaga attattggaa ttagataaat gggcaagttt 8220 

gtggaattgg tttaacataa caaattggct gtggtatata aaattattca taatgatagt 8280 

aggaggcttg gtaggtttaa gaatagtttt tgctgtactt tctatagtga atagagttag 8340 

gcagggatat tcaccattat cgtttcagac ccacctccca atcccgaggg gacccgacag 8400 

gcccgaagga atagaagaag aaggtggaga gagagacaga gacagatcca ttcgattagt 8460 

gaacggatcc ttagcactta tctgggacga tctgcggagc ctgtgcctct tcagctacca 8520 

ccgcttgaga gacttactct tgattgtaac gaggattgtg gaacttctgg gacgcagggg 8580 

gtgggaagcc ctcaaatatt ggtggaatct cctacagtat tggagtcagg aactaaagaa 8640 

tagtgctgtt aacttgctca atgccacagc catagcagta gctgagggga cagatagggt 8700 

tatagaagta ttacaagcag cttatagagc tattcgccac atacctagaa gaataagaca 8760 



gggcttggaa 
gatggcctgc 
cagtatctcg 
atgctgcttg 
ctcaggtacc 
aagaaaaggg 
tgtggatcta 
gggtcagata 
aggtagaaga 
gaatggatga 
atcacgtggc 
acaagggact 
ggcgagccct 
ttagaccaga 
caataaagct 
aactagagat 



aggattttgc 
tgtaagggaa 
agacctagaa 
tgcctggcta 
tttaagacca 
gggactggaa 
ccacacacaa 
tccactgacc 
ggccaataaa 
ccctgagaga 
ccgagagctg 
ttccgctggg 
cagatgctgc 
tctgagcctg 
tgccttgagt 
ccctcagacc 



tataagatgg 
agaatgagac 
aaacatggag 
gaagcacaag 
atgacttaca 
gggctaattc 
ggctacttcc 
tttggatggt 
ggagagaaca 
gaagtgttag 
catccggagt 
gactttccag 
atataagcag 
ggagctctct 
gcttcaagta 
cttttagtca 



gtggcaagtg 
gagctgagcc 
caatcacaag 
aggaggaaga 
aggcagctgt 
actcccaaag 
ctgattggca 
gctacaagct 
ccagcttgtt 
agtggaggtt 
acttcaagaa 
ggaggcgtgg 
ctgctttttg 
ggctaactag 
gtgtgtgccc 
gtgtggaaaa 



gtcaaaaagt 
agcagcagat 
tagcaataca 
ggtgggtttt 
agatcttagc 
aagacaagat 
gaactacaca 
agtaccagtt 
acaccctgtg 
tgacagccgc 
ctgctgacat 
cctgggcggg 
cctgtactgg 
ggaacccact 
gtctgttgtg 
tctctagca 



agtgtgattg 
ggggtgggag 
gcagctaaca 
ccagtcacac 
cactttttaa 
atccttgatc 
ccagggccag 
gagccagata 
agcctgcatg 
ctagcatttc 
cgagcttgct 
actggggagt 
gtctctctgg 
gcttaagcct 
tgactctggt 



8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9709 



<210> 2 
<211> 854 
<212> PRT 

<213> Human immunodeficiency virus 
<220> 

<223> Envelope Polyprotein 
<400> 2 

Met Arg Val Lys Glu Lys Tyr Gin His Leu Trp Arg Trp Gly Trp Lys 
15 10 15 

Trp Gly Thr Met Leu Leu Gly He Leu Met He Cys Ser Ala Thr Glu 
20 25 30 

Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala 
35 40 45 

Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu 
50 55 60 

Val His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn 
65 70 75 80 

Pro Gin Glu Val Val Leu Val Asn Val Thr Glu Asn Phe Asn Met Trp 
85 90 95 

Lys Asn Asp Met Val Glu Gin Met His Glu Asp He He Ser Leu Trp 
100 105 110 

Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Ser 
115 120 125 

Leu Lys Cys Thr Asp Leu Lys Asn Asp Thr Asn Thr Asn Ser Ser Ser 
130 135 140 



Gly Arg Met He Met Glu Lys Gly Glu He Lys Asn Cys Ser Phe Asn 
145 150 155 160 



lie Ser Thr Ser lie Arg Asp Lys Val Gin Lys Glu Tyr Ala Phe Phe 
165 170 175 

Tyr Lys Leu Asp lie Val Pro lie Asp Asn Thr Ser Tyr Arg Leu lie 
180 185 190 

Ser Cys Asn Thr Ser Val lie Thr Gin Ala Cys Pro Lys Val Ser Phe 
195 200 205 

Glu Pro lie Pro lie His Tyr Cys Ala Pro Ala Gly Phe Ala lie Leu 
210 215 220 

Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys Thr Asn Val 
225 230 235 240 

Ser Thr Val Gin Cys Thr His Gly lie Arg Pro Val Val Ser Thr Gin 
245 250 255 

Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Asp Val Val lie Arg Ser 
260 265 270 

Ala Asn Phe Thr Asp Asn Ala Lys Thr lie lie Val Gin Leu Asn Thr 
275 280 285 



Ser Val Glu lie Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser 
290 295 300 

lie Arg lie Gin Arg Gly Pro Gly Arg Ala Phe Val Thr lie Gly Lys 
305 310 315 320 

lie Gly Asn Met Arg Gin Ala His Cys Asn lie Ser Arg Ala Lys Trp 
325 330 335 

Asn Ala Thr Leu Lys Gin lie Ala Ser Lys Leu Arg Glu Gin Phe Gly 
340 345 350 

Asn Asn Lys Thr lie lie Phe Lys Gin Ser Ser Gly Gly Asp Pro Glu 
355 360 365 

lie Val Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn 
370 375 380 

Ser Thr Gin Leu Phe Asn Ser Thr Trp Phe Asn Ser Thr Trp Ser Thr 
385 390 395 400 

Glu Gly Ser Asn Asn Thr Glu Gly Ser Asp Thr lie Thr Leu Pro Cys 
405 410 415 

Arg lie Lys Gin Phe lie Asn Met Trp Gin Glu Val Gly Lys Ala Met 
420 425 430 

Tyr Ala Pro Pro lie Ser Gly Gin lie Arg Cys Ser Ser Asn lie Thr 
435 440 445 

Gly Leu Leu Leu Thr Arg Asp Gly Gly Asn Asn Asn Asn Gly Ser Glu 




450 455 460 

lie Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu 
465 470 475 480 

Leu Tyr Lys Tyr Lys Val Val Lys lie Glu Pro Leu Gly Val Ala Pro 
485 490 495 

Thr Lys Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly 
500 505 510 

lie Gly Ala Leu Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
515 520 525 

Gly Cys Thr Ser Met Thr Leu Thr Val Gin Ala Arg Gin Leu Leu Ser 
530 535 540 

Asp lie Val Gin Gin Gin Asn Asn Leu Leu Arg Ala lie Glu Ala Gin 
545 550 555 560 

Gin His Leu Leu Gin Leu Thr Val Trp Gly lie Lys Gin Leu Gin Ala 
565 570 575 

Arg lie Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin Gin Leu Leu Gly 
580 585 590 

lie Trp Gly Cys Ser Gly Lys Leu lie Cys Thr Thr Ala Val Pro Trp 
595 600 605 

Asn Ala Ser Trp Ser Asn Lys Ser Leu Glu Gin lie Trp Asn Asn Met 
610 615 620 



Thr Trp Met Glu Trp Asp Arg Glu lie Asn Asn Tyr Thr Ser Leu lie 
625 630 635 640 

His Ser Leu .lie Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin 
645 650 655 

Glu Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe Asn 
660 665 670 

lie Thr Asn Trp Leu Trp Tyr lie Lys Leu Phe lie Met lie Val Gly 
675 680 685 

Gly Leu Val Gly Leu Arg lie Val Phe Ala Val Leu Ser lie Val Asn 
690 695 700 

Arg Val Arg Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr His Leu Pro 
705 710 715 720 

He Pro Arg Gly Pro Asp Arg Pro Glu Gly He Glu Glu Glu Gly Gly 
725 730 735 

Glu Arg Asp Arg Asp Arg Ser He Arg Leu Val Asn Gly Ser Leu Ala 
740 745 750 



Leu lie Trp Asp Asp Leu Arg Ser Leu Cys Leu Phe Ser Tyr His Arg 
755 760 765 



Leu Arg Asp Leu Leu Leu He Val Thr Arg He Val Glu Leu Leu Gly 
770 775 780 

Arg Arg Gly Trp Glu Ala Leu Lys Tyr Trp Trp Asn Leu Leu Gin Tyr 
785 790 795 800 

Trp Ser Gin Glu Leu Lys Asn Ser Ala Val Asn Leu Leu Asn Ala Thr 
805 810 815 

Ala He Ala Val Ala Glu Gly Thr Asp Arg Val He Glu Val Leu Gin 
820 825 830 

Ala Ala Tyr Arg Ala He Arg His He Pro Arg Arg He Arg Gin Gly 
835 840 845 

Leu Glu Arg He Leu Leu 
850 



<210> 3 
<211> 107 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide for cloning 

<400> 3 

aagatgtagt aattagatct gccaatttca cagacaatgc taaaaccata atagtacagc 60 
tgaacacatc gttagaaatt aattgtacaa gacccaacaa caataca 107 



<210> 4 
<211> 120 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide for cloning 

<220> 

<221> misc_f eature 
<222> (97) . . (99) 

<223> Sequence at this position: (GA) (AT) (GATC) , ie. 
base at position 97 can be G or A, base at 
position 98 can be A or T, and base at 
position 99 can be G, A, T or C . 

<400> 4 

ttttgctcta gaaatgttac aatgtgcttg tcttatgtct cctgttgcag cttctgttgc 60 
atgaaatgct ctccctggtc cgatatggat actatgrwnt tttcttgtat tgttgttggg 120 



<210> 5 

<211> 17 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of the artificial sequence 
sequencing primer 

<400> 5 

ccatgtacaa atgtcag 



<210> 6 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of the artificial sequence 
sequencing primer 

<400> 6 

aaaactgtgc gttacaa 



<210> 7 

<211> 17 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of the artificial sequence 
sequencing primer 

<400> 7 

gtaaaacgac ggccagt 



<210> 8 

<211> 17 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of the artificial sequence 
sequencing primer 

<400> 8 

caggaaacag ctatgac 



<210> 9 
<211> 2148 



<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of the artificial sequence: synthetic DNA 



<220> 

<221> misc_f eature 

<222> (3) . . (9) 

<223> BstEII cleavage site 



<220> 

<221> misc_feature 
<222> (2143) . . (2148) 
<223> BamHI cleavage site 



<400> 9 

tgggtcaccg 

gcatcagatg 

gtacccacag 

atgtggaaaa 

agccttaagc 

aagaatgata 

aaaaactgca 

ttcttttata 

aacacctcag 

tattgtgccc 

ggaccatgta 

actcaactgc 

ttcacagaca 

acaagaccca 

tttgttacaa 

aaatggaatg 

aaaacaataa 

aattgtggag 

aatagtactt 

ccatgcagaa 

cctcccatca 

gatggtggta 

gataattgga 

gcacccacca 

gctctgttcc 

ctgacggtac 

agggcaattg 

caggcaagaa 

ggttgctctg 

aaatctctgg 

aattacacaa 

gaacaagaat 

aattggctgt 

atagtttttg 

tttcagaccc 

ggtggagaga 



tctattatgg 
ctaaagcata 
accccaaccc 
atgacatggt 
catgtgtaaa 
ctaataccaa 
gcttcaatat 
aacttgatat 
tgatcacaca 
cggctggttt 
caaatgtcag 
tgttaaatgg 
atgctaaaac 
acaacaatac 
taggaaaaat 
ccactttaaa 
tctttaagca 
gggaattttt 
ggagtactga 
taaaacaatt 
gtggccaaat 
ataacaacaa 
gaagtgaatt 
aggcaaagag 
ttgggttctt 
aggccagaca 
aggcgcaaca 
tcctggctgt 
gaaaactcat 
aacagatttg 
gcttaataca 
tattggaatt 
ggtatataaa 
ctgtactttc 
acctcccaat 
gagacagaga 



ggtgcctgtg 
tgatacagag 
acaagaagta 
agaacagatg 
attaacccca 
tagtagtagc 
cagcacaagc 
agtaccaata 
ggcctgtcca 
tgcgattcta 
cacagtacaa 
cagtctagca 
cataatagta 
aagaaaaagt 
aggaaatatg 
acagatagct 
gtcatccgga 
ctactgtaat 
agggtcaaat 
tataaacatg 
tagatgttca 
tgggtccgag 
atataaatat 
acgcgtggtg 
gggagcagca 
attattgtct 
gcatctgttg 
ggaaagatac 
ttgcaccact 
gaataacatg 
ctccttaatt 
agataaatgg 
attattcata 
tatagtgaat 
cccgagggga 
cagatccatt 



tggaaggaag 
gtacataatg 
gtattggtaa 
catgaggata 
ctctgtgtta 
gggagaatga 
ataagagata 
gataatacca 
aaggtatcct 
aaatgtaata 
tgtacacatg 
gaagaagatg 
cagctgaaca 
atccgtatcc 
agacaagcac 
agcaaattaa 
ggggacccag 
tcaacacaac 
aacactgaag 
tggcaggaag 
tcaaatatta 
attttcagac 
aaagtagtaa 
cagagagaaa 
ggaagcacta 
gatatagtgc 
caactcacag 
ctaaaggatc 
gctgtgcctt 
acctggatgg 
gaagaatcgc 
gcaagtttgt 
atgatagtag 
agagttaggc 
cccgacaggc 
cgattagtga 



caaccaccac 
tttgggccac 
atgtgacaga 
taatcagttt 
gtttaaagtg 
taatggagaa 
aggtgcagaa 
gctataggtt 
ttgagccaat 
ataagacgtt 
gaattcgacc 
tagtaattag 
catctgtaga 
agaggggacc 
attgtaacat 
gagaacaatt 
aaattgtaac 
tgtttaatag 
gaagtgacac 
taggaaaagc 
ctgggctgct 
ctggaggagg 
aaattgaacc 
agcgcgcagt 
tgggcgcagc 
agcagcagaa 
tctggggcat 
aacagctcct 
ggaatgctag 
agtgggacag 
aaaaccagca 
ggaattggtt 
gaggcttggt 
agggatattc 
ccgaaggaat 
acggatcc 



tctattttgt 
acatgcctgt 
aaattttaac 
atgggatcaa 
cactgatttg 
aggagagata 
agaatatgca 
gataagttgt 
tcccatacat 
caatggaaca 
agtagtatca 
atctgccaat 
aattaattgt 
agggagagca 
ttctagagca 
tggaaataat 
gcacagtttt 
tacttggttt 
aatcacactc 
aatgtatgcc 
attaactcga 
cgatatgagg 
attaggagta 
gggaatagga 
gtcaatgacg 
caatttgctg 
caaacagctc 
ggggatttgg 
ttggagtaat 
agaaattaac 
agaaaagaat 
taacataaca 
aggtttaaga 
accattatcg 
agaagaagaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2148 



<210> 10 
<211> 6229 



<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of the artificial sequence: synthetic DNA 
<220> 

<221> sig_peptide 
<222> (1293) . . (1295) 
<223> env ATG 

<220> 

<221> misc_f eature 

<222> (1377) . . (1379) 

<223> env AGT, gpl20 start 

<220> 

<221> misc_f eature 

<222> (1397) . . (1403) 

<223> BstEII cleavage site 

<220> 

<221> misc_f eature 
<222> (3537) . . (3542) 
<223> BamHI cleavage site 

<220> 

<221> misc_feature 
<222> (3855) . . (3857) 
<223> env TAA # stop 

<400> 10 

ctgacgcgcc ctgtagcggc gcattaagcg cggcgggtgt ggtggttacg cgcagcgtga 60 
ccgctacact tgccagcgcc ctagcgcccg ctcctttcgc tttcttccct tcctttctcg 120 
ccacgttcgc cggctttccc cgtcaagctc taaatcgggg gctcccttta gggttccgat 180 
ttagtgcttt acggcacctc gaccccaaaa aacttgatta gggtgatggt tcacgtagtg 240 
ggccatcgcc ctgatagacg gtttttcgcc ctttgacgtt ggagtccacg ttctttaata 300 
gtggactctt gttccaaact ggaacaacac tcaaccctat ctcggtctat tcttttgatt 360 
tataagggat tttgccgatt tcggcctatt ggttaaaaaa tgagctgatt taacaaaaat 42 0 
ttaacgcgaa ttttaacaaa atattaacgc ttacaatttc cattcgccat tcaggctgcg 480 
caactgttgg gaagggcgat cggtgcgggc ctcttcgcta ttacgccagc tggcgaaagg 540 
gggatgtgct gcaaggcgat taagttgggt aacgccaggg ttttcccagt cacgacgttg 600 
taaaacgacg gccagtgagc gtctagttat taatagtaat caattacggg gtcattagtt 660 
catagcccat atatggagtt ccgcgttaca taacttacgg taaatggccc gcctggctga 720 
ccgcccaacg acccccgccc attgacgtca ataatgacgt atgttcccat agtaacgcca 780 
atagggactt tccattgacg tcaatgggtg gagtatttac ggtaaactgc ccacttggca 840 
gtacatcaag tgtatcatat gccaagtacg ccccctattg acgtcaatga cggtaaatgg 900 
cccgcctggc attatgccca gtacatgacc ttatgggact ttcctacttg gcagtacatc 960 
tacgtattag tcatcgctat taccatggtg atgcggtttt ggcagtacat caatgggcgt 102 0 
ggatagcggt ttgactcacg gggatttcca agtctccacc ccattgacgt caatgggagt 1080 
ttgttttggc accaaaatca acgggacttt ccaaaatgtc gtaacaactc cgccccattg 1140 
acgcaaatgg gcggtaggcg tgtacggtgg gaggtctata taagcagagc tcgtttagtg 12 00 
aaccgtcaga tcgcctggag acgccatcca cgctgttttg acctccatag aagacaccgg 1260 
gacaattcga gctcggtacc gtcgacgcca ccatgagagt gaaggagaag tatcagcact 1320 
tgtggagatg ggggtggaaa tggggcacca tgctccttgg gatattgatg atctgtagtg 1380 
ctacagaaaa attgtgggtc accgtctatt atggggtacc tgtgtggaag gaagcaacca 1440 
ccactctatt ttgtgcatca gatgctaaag catatgatac agaggtacat aatgtttggg 1500 



ccacacatgc ctgtgtaccc acagacccca 
cagaaaattt taacatgtgg aaaaatgaca 
gtttatggga tcaaagccta aagccatgtg 
agtgcactga tttgaagaat gatactaata 
agaaaggaga gataaaaaac tgctctttca 
agaaagaata tgcattcttt tataaacttg 
ggttgataag ttgtaacacc tcagtcatta 
caattcccat acattattgt gccccggctg 
cgttcaatgg aacaggacca tgtacaaatg 
ggccagtagt atcaactcaa ctgctgttaa 
ttagatctgc caatttcaca gacaatgcta 
tagaaattaa ttgtacaaga cccaacaaca 
gaccagggag agcatttgtt acaataggaa 
acattagtag agcaaaatgg aatgccactt 
aatttggaaa taataaaaca ataatcttta 
taacgcacag ttttaattgt ggaggggaat 
atagtacttg gtttaatagt acttggagta 
acacaatcac actcccatgc agaataaaac 
aagcaatgta tgcccctccc atcagtggac 
tgctattaac aagagatggt ggtaataaca 
gaggcgatat gagggacaat tggagaagtg 
aaccattagg agtagcaccc accaaggcaa 
cagtgggaat aggagctttg ttccttgggt 
gcacgtcaat gacgctgacg gtacaggcca 
agaacaattt gctgagggct attgaggcgc 
gcatcaaaca gctccaggca agaatcctgg 
tcctggggat ttggggttgc tctggaaaac 
ctagttggag taataaatct ctggaacaga 
acagagaaat taacaattac acaagcttaa 
agcaagaaaa gaatgaacaa gaattattgg 
ggtttaacat aacaaattgg ctgtggtata 
tggtaggttt aagaatagtt tttgctgtac 
attcaccatt atcgtttcag acccacctcc 
gaatagaaga agaaggtgga gagagagaca 
ccttagcact tatctgggac gatctgcgga 
gagacttact cttgattgta acgaggattg 
ccctcaaata ttggtggaat ctcctacagt 
ttaacttgct caatgccaca gccatagcag 
tattacaagc agcttataga gctattcgcc 
aaaggatttt gctataagat gggtggcaag 
gctgtaaggg aaagaatgag acgagctgag 
cgagatctag actagaacta gcttcgatcc 
acaaaccaca actagaatgc agtgaaaaaa 
tgctttattt gtaaccatta taagctgcaa 
ttttatgttt caggttcagg gggaggtgtg 
caaatgtggt atggctgatt atgatcctgc 
ctctgacaca tgcagctccc ggagacggtc 
agacaagccc gtcagggcgc gtcagcgggt 
cagtcacgta gcgatagcgg agtgtatact 
tactgagagt gcaccatatg tcgggccgcg 
cctgacgagc atcacaaaaa tcgacgctca 
taaagatacc aggcgtttcc ccctggaagc 
ccgcttaccg gatacctgtc cgcctttctc 
tcacgctgta ggtatctcag ttcggtgtag 
gaaccccccg ttcagcccga ccgctgcgcc 
ccggtaagac acgacttatc gccactggca 
aggtatgtag gcggtgctac agagttcttg 



acccacaaga agtagtattg gtaaatgtga 1560 
tggtagaaca gatgcatgag gatataatca 1620 
taaaattaac cccactctgt gttagtttaa 1680 
ccaatagtag tagcgggaga atgataatgg 1740 
atatcagcac aagcataaga gataaggtgc 1800 
atatagtacc aatagataat accagctata 1860 
cacaggcctg tccaaaggta tcctttgagc 1920 
gttttgcgat tctaaaatgt aataataaga 1980 
tcagcacagt acaatgtaca catggaatca 2 040 
atggcagtct agcagaagaa gatgtagtaa 2100 
aaaccataat agtacagctg aacacatctg 2160 
atacaagaaa aagtatccgt atccagaggg 222 0 
aaataggaaa tatgagacaa gcacattgta 2280 
taaaacagat agctagcaaa ttaagagaac 2340 
agcaatcctc aggaggggac ccagaaattg 2400 
ttttctactg taattcaaca caactgttta 2460 
ctgaagggtc aaataacact gaaggaagtg 2 52 0 
aatttataaa catgtggcag gaagtaggaa 2580 
aaattagatg ttcatcaaat attactgggc 2640 
acaatgggtc cgagatcttc agacctggag 2700 
aattatataa atataaagta gtaaaaattg 2760 
agagaagagt ggtgcagaga gaaaaaagag 2820 
tcttgggagc agcaggaagc actatgggct 2880 
gacaattatt gtctgatata gtgcagcagc 2 940 
aacagcatct gttgcaactc acagtctggg 3000 
ctgtggaaag atacctaaag gatcaacagc 3060 
tcatttgcac cactgctgtg ccttggaatg 3120 
tttggaataa catgacctgg atggagtggg 3180 
tacactcctt aattgaagaa tcgcaaaacc 3240 
aattagataa atgggcaagt ttgtggaatt 33 00 
taaaattatt cataatgata gtaggaggct 33 60 
tttctatagt gaatagagtt aggcagggat 3420 
caatcccgag gggacccgac aggcccgaag 3480 
gagacagatc cattcgatta gtgaacggat 3540 
gcctgtgcct cttcagctac caccgcttga 3600 
tggaacttct gggacgcagg gggtgggaag 3 660 
attggagtca ggaactaaag aatagtgctg 3720 
tagctgaggg gacagatagg gttatagaag 3780 
acatacctag aagaataaga cagggcttgg 3840 
tggtcaaaaa gtagtgtgat tggatggcct 3900 
ccagcagcag atggggtggg agcagtatct 3960 
agacatgata agatacattg atgagtttgg 4020 
atgctttatt tgtgaaattt gtgatgctat 4080 
taaacaagtt aacaacaaca attgcattca 4140 
ggaggttttt taaagcaagt aaaacctcta 4200 
ctcgcgcgtt tcggtgatga cggtgaaaac 4260 
acagcttgtc tgtaagcgga tgccgggagc 4320 
gttggcgggt gtcggggcgc agccatgacc 43 80 
ggcttaacta tgcggcatca gagcagattg 4440 
ttgctggcgt ttttccatag gctccgcccc 4500 
agtcagaggt ggcgaaaccc gacaggacta 4560 
tccctcgtgc gctctcctgt tccgaccctg 4620 
ccttcgggaa gcgtggcgct ttctcatagc 4680 
gtcgttcgct ccaagctggg ctgtgtgcac 4740 
ttatccggta actatcgtct tgagtccaac 4800 
gcagccactg gtaacaggat tagcagagcg 4860 
aagtggtggc ctaactacgg ctacactaga 4920 



aggacagtat 
agctcttgat 
cagattacgc 
gacgctcagt 
atcttcacct 
gagtaaactt 
tgtctatttc 
gagggcttac 
ccagatttat 
actttatccg 
ccagttaata 
tcgtttggta 
cccatgttgt 
ttggccgcag 
ccatccgtaa 
tgtatgcggc 
agcagaactt 
atcttaccgc 
gcatctttta 
aaaaagggaa 
tattgaagca 
aaaaataaac 



ttggtatctg 
ccggcaaaca 
gcagaaaaaa 
ggaacgaaaa 
agatcctttt 
ggtctgacag 
gttcatccat 
catctggccc 
cagcaataaa 
cctccatcca 
gtttgcgcaa 
tggcttcatt 
gcaaaaaagc 
tgttatcact 
gatgcttttc 
gaccgagttg 
taaaagtgct 
tgttgagatc 
ctttcaccag 
taagggcgac 
tttatcaggg 
aaataggggt 



cgctctgctg 
aaccaccgct 
aggatctcaa 
ctcacgttaa 
aaattaaaaa 
ttaccaatgc 
agttgcctga 
cagtgctgca 
ccagccagcc 
gtctattaat 
cgttgttgcc 
cagctccggt 
ggttagctcc 
catggttatg 
tgtgactggt 
ctcttgcccg 
catcattgga 
cagttcgatg 
cgtttctggg 
acggaaatgt 
ttattgtctc 
tccgcgcaca 



aagccagtta 
ggtagcggtg 
gaagatcctt 
gggattttgg 
tgaagtttta 
ttaatcagtg 
ctccccgtcg 
atgataccgc 
ggaagggccg 
tgttgccggg 
attgctacag 
tcccaacgat 
ttcggtcctc 
gcagcactgc 
gagtactcaa 
gcgtcaatac 
aaacgttctt 
taacccactc 
tgagcaaaaa 
tgaatactca 
atgagcggat 
tttccccgaa 



ccttcggaaa 
gtttttttgt 
tgatcttttc 
tcatgagatt 
aatcaatcta 
aggcacctat 
tgtagataac 
gagacccacg 
agcgcagaag 
aagctagagt 
gcatcgtggt 
caaggcgagt 
cgatcgttgt 
ataattctct 
ccaagtcatt 
gggataatac 
cggggcgaaa 
gtgcacccaa 
caggaaggca 
tactcttcct 
acatatttga 
aagtgccac 



aagagttggt 
ttgcaagcag 
tacggggtct 
atcaaaaagg 
aagtatatat 
ctcagcgatc 
tacgatacgg 
ctcaccggct 
tggtcctgca 
aagtagttcg 
gtcacgctcg 
tacatgatcc 
cagaagtaag 
tactgtcatg 
ctgagaatag 
cgcgccacat 
actctcaagg 
ctgatcttca 
aaatgccgca 
ttttcaatat 
atgtatttag 



4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6229 





<210> 


11 




<211> 


860 


LI i 


<212> 


DNA 


; S 


<213> 


Human immunodeficiency virus 




<220> 






<221> 


misc_f eature 




<222> 


(1) . . (860) 




<223> 


PI-932 original sequence V1-V2 



<400> 11 

tgtgtaccca 

aacatgtgga 

caaagcctaa 

gctgatttaa 

actcataaca 

ttcaaggtca 

cttgatatag 

aacacctcga 

ttttgtgccc 

ggaccatgta 

actcaactgc 

ttcacagaca 

ataagacccc 

acaacaagaa 

tggaataaca 



cagaccccaa 
aaaatgacat 
agccatgtgt 
attgcaataa 
atactattag 
ccacaggcat 
taccaataga 
ccattacaca 
cggctggttt 
aaaatgtcag 
tgttgaatgg 
atgctaaaac 
acaacactgt 
aaataaaagg 
ctttacaaga 



cccacaaaag 
ggtagaacag 
aaaactaacc 
tactgattta 
tgggaaaata 
aagggataag 
taatgataag 
ggcctgtcca 
tgcgattcta 
cacagtacaa 
cagtctagca 
cataatagta 
aacagacagg 
agatataaga 



gtagtattgg 
atgcatgagg 
ccactctgtg 
aattgcacta 
atagagaaag 
atgcaaaaag 
aataatacta 
aaggtatcct 
aagtgtaatg 
tgtacacatg 
gaaaaagaag 
cagctgaagg 
atacatatag 
caagcacatt 



aaaatgtgac 
atataatcaa 
ttactttaaa 
aagctaattt 
tagaaataaa 
aatatgcact 
actttatatt 
ttgagccaat 
aaaagagtta 
gaattaggcc 
tagtaattag 
aatctgtaaa 
ggccagggag 
gtagccttag 



agaaaatttt 60 
tttatgggat 12 0 
ttgcactgat 180 
ggggaaaaat 240 
aaactgctct 300 
tttgaataaa 360 
gataagttgt 420 
tcccatacat 480 
cagtggaaaa 540 
agtagtgtca 600 
atctgagaat 660 
cattacttgt 720 
atcatttcat 780 
gagaaaagat 840 
860 



<210> 12 
<211> 870 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of the artificial sequence: PI-932 

gene cassette, comprising the cleavage sites for 
restriction enzymes BspTl, PstI, Bell, EcoRI, 
Bglll, PvuII, Xball, Nhel 



<400> 12 

tgtgtaccca 

aacatgtgga 

caaagectta 

gctgatttaa 

actcataact 

ttcaaggtca 

cttgatatag 

aacacctcgg 

ttttgtgccc 

ggaccatgta 

actcaactgc 

ttcacagaca 

ataagacccc 

acaacaagaa 

tggaataaca 



cagaccccaa 
aaaatgacat 
agccatgtgt 
attgeaataa 
gcagtattag 
ccacaggcat 
taccaataga 
tgatcacaca 
cggctggttt 
aaaatgtcag 
tgttgaatgg 
atgetaaaac 
acaacactgt 
aaataaaagg 
ctttacaaga 



cccacaaaag 
ggtagaacag 
aaaactaacc 
tactgattta 
tgggaaaata 
aagggataag 
taatgataag 
ggcctgtcca 
tgegattcta 
cacagtacaa 
cagtctagca 
cataatagta 
aacagacagg 
agatataaga 
gatagctagc 



gtagtattgg 
atgeatgagg 
ccactctgtg 
aattgeacta 
atagagaaag 
atgcaaaaag 
aataatacta 
aaggtatcct 
aagtgtaatg 
tgtacacatg 
gaaaaagaag 
cagctgaagg 
atacatatag 
caagcacatt 



aaaatgtgac 
atataatcaa 
ttactttaaa 
aagctaattt 
tagaaataaa 
aatatgeact 
actttatatt 
ttgagccaat 
aaaagagtta 
gaatteggee 
tagtaattag 
aatctgtaaa 
ggecagggag 
gtagecttte 



agaaaatttt 60 
tttatgggat 120 
ttgeactgat 180 
ggggaaaaat 240 
aaactgetet 300 
tttgaataaa 360 
gataagttgt 420 
tcccatacat 480 
cagtggaaaa 540 
agtagtgtca 600 
atctgagaat 660 
cattacttgt 720 
atcatttcat 780 
tagaaaagat 840 
870 



